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Abstract. We analyze the eigenvalues of the adjacency matrices of a wide 
variety of random trees. Using general, broadly applicable arguments based 
on the interlacing inequalities for the eigenvalues of a principal submatrix of 
a Hermitian matrix and a suitable notion of local weak convergence for an 
ensemble of random trees that we call probability fringe convergence, we show 
that the empirical spectral distributions for many random tree models converge 
to a deterministic (model dependent) limit as the number of vertices goes to 
infinity. 

Moreover, the masses assigned by the empirical spectral distributions to 
individual points also converge in distribution to constants. We conclude for 
ensembles such as the linear preferential attachment models, random recursive 
trees, and the uniform random trees that the limiting spectral distribution has 
a set of atoms that is dense in the real line. We obtain lower bounds on the 
mass assigned to zero by the empirical spectral measures via the connection 
between the number of zero eigenvalues of the adjacency matrix of a tree and 
the cardinality of a maximal matching on the tree. In particular, we employ a 
simplified version of an algorithm due to Karp and Sipser to construct maxi- 
mal matchings and understand their properties. Moreover, we show that the 
total weight of a weighted matching is asymptotically equivalent to a constant 
multiple of the number of vertices when the edge weights are independent, iden- 
tically distributed, non-negative random variables with finite expected value, 
thereby significantly extending a result obtained by Aldous and Steele in the 
special case of uniform random trees. 

We greatly generalize a celebrated result obtained by Schwcnk for the uni- 
form random trees by showing that if any ensemble converges in the probabil- 
ity fringe sense and a very mild further condition holds, then, with probability 
converging to one, the spectrum of a realization is shared by at least one other 
(non-isomorphic) tree. 

For the the linear preferential attachment model with parameter a > — 1, 
we show that for any fixed k the k largest eigenvalues jointly converge in 
distribution to a non-trivial limit when rescaled by n^^'^'''^ , where 7^ = a + 2 
is the Malthusian rate of growth parameter for an associated continuous time 
branching process. 
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1. Introduction 

The study of large random matrices and their eigenvalues is one of the primary 
themes of current research in probability. It finds applications in such diverse 
fields as number theory, random partitions, free probability and operator algebras, 
high-dimensional statistical analysis, nuclear physics, signal processing, wireless 
communication, quantum percolation, and the operation of search engines. Some 
recent book length expositions are |Dei99| |BI0T| IMeh04| ITV04| IGui09| lEb?09) . 

The objects of interest in this field are usually random real symmetric or complex 
Hermitian matrices. For example, one of the most popular models is the Gauss- 
ian unitary ensemble (GUE), where the matrices are Hermitian, the entries above 
the diagonal are independent, identically distributed, standard complex Gaussian 
random variables, and the entries on the diagonal are independent, identically dis- 
tributed, standard real Gaussian random variables. Much is now known about the 
asymptotic behavior of objects such as the empirical distribution of the eigenvalues 
and the behavior of the maximal eigenvalue. 

Here we investigate random matrices with substantially greater structure and 
complexity than the GUE, namely the adjacency matrices of random graphs, al- 
though our methods are also applicable to the closely related Laplacian matrices. 
The recent availability of large amounts of data has led to an explosion in the num- 
ber of models used to model real- world networks, and dynamically grown models 
such as various preferential attachment schemes have attracted significant interest 
from the computer science and mathematical physics community. It is known (see, 
for example, the monographs [Chu97l IGRnil |Big93[ IGR,S97I ICDS95I ^DGT88j) 
that a surprising diversity of features of a graph are determined, at least in part, 
by the behavior of the eigenvalues of its adjacency and Laplacian matrices. 

We concentrate on the adjacency matrices of various ensembles of random 
trees. Random trees arise in numerous contexts, ranging from the analysis of 
database and search algorithms in computer science to models of phylogenies 
(that is, evolutionary family trees) in biology. Moreover, many of the preferen- 
tial attachment schemes for networks are also largely random models of grow- 
ing trees (see, for example, [B ha07) for a survey of some of the more popular 
schemes). We note that, although trees are rather simple graphs, the analysis of 
their eigenvalues is still rather challenging, and even in the case of highly symmet- 
ric deterministic trees explicit formulae for spectra have only been found recently 
[RojOSt .RROTal |Roj07[ IRR07bl |Roj06a[ [Roj06b, RS05J . 

We introduce a general technique based on the concept of probability fringe con- 
vergence for showing that the spectral distributions (that is, empirical distributions 
of the eigenvalues) of the adjacency matrices of an ensemble of random trees con- 
verge in the topology of weak convergence of probability measures on the line to a 
deterministic limit as the number of vertices goes to infinity, and we show how this 
technique applies in several natural examples. 

The notion of probability fringe convergence is a type of local weak convergence 
for random graphs that involves the convergence in distribution of suitably defined 
neighborhoods of a vertex picked uniformly from a random graph as the size of the 
graph goes to infinity. Surveys of this general methodology are |Ald91ai IAS04| . 
Such convergence results for random trees where the limit is described in terms 
of a continuous-time branching processes go back to |JN84[ INJ84| . The first (to 
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our knowledge) use of such techniques in various general models of preferential 
attachment is |RTV07j . Such notions arc further explored in [Bha07| . 

The key algebraic element of our proof of convergence of the spectral distribu- 
tions is the set of interlacing inequalities between the eigenvalues of a Hermitian 
matrix and the eigenvalues of one of its principal sub-matrices. The interlacing 
inequalities allow us to break a large tree up into a forest of smaller trees by delet- 
ing a small proportion of edges and conclude that the spectral distribution of the 
tree is close to that of the forest which, in turn, is a convex combination of the 
spectral distributions of its component sub-trees. If the decomposition into a forest 
is done appropriately, then the resulting sub-trees are "locally defined" in a sense 
that allows probability fringe convergence to be brought to bear to show that the 
spectral distribution of the forest converges. 

We note that interlacing has found other applications in algebraic graph theory 
p70l[Fi599llHae95| . 

Another interesting technical aspect of our work is that the method of moments, 
one of the most commonly used tools in random matrix theory, fails for some natural 
ensembles because, as we observe in Subsection 14.11 expected values of moments of 
the spectral distribution go to infinity. 

While our method for showing that the spectral distribution converges is quite 
general, it does not provide any sort of characterization of the limiting distribution. 
In Section [8] we look at an extremely simple random tree that is obtained by taking 
the tree consisting of a path of n points and independently connecting an edge to 
each point with equal probability, so that the resulting tree resembles a comb with 
some of its teeth missing. Our probability fringe convergence methodology does not 
apply immediately to this ensemble of random trees, but a straightforward modifi- 
cation of it does. We compute the asymptotic moments of the spectral distribution 
for this ensemble and show that even in this simple case the answer is rather intri- 
cate, indicating that we should perhaps not expect simple characterizations of the 
limit for more complex models. 

Quite closely related to our results for the spectral distribution is the recent 
work |BL07| . where similar local weak convergence techniques are combined with 
Stieltjes transform methods to prove various limiting results for families of random 
graphs. 

We extend our results on the convergence of the spectral distribution in two 
different directions. 

First, we show for any 7 € R that the proportion of eigenvalues that have the 
value 7 converges to a constant under the assumption of probability fringe conver- 
gence. Moreover, we give a simple sufhcient condition for the limit to be positive 
and apply this condition to show for several models that the limiting spectral dis- 
tribution has a set of atoms that is dense in R. We pay particular attention to the 
proportion of zero eigenvalues, a quantity of importance in areas such as quantum 
percolation [BGOOi IBGOl] . It is possible to obtain much more exact information on 
the limiting proportion because of the connection between the number of zero eigen- 
values of the adjacency matrix of a tree and the cardinality of a maximal matching. 
In particular, we use a simplified version of the Karp-Sipser algorithm |KS81j to 
construct maximal matchings. We also use our methods to obtain the asymptotic 
behavior of the total weight of a maximal weighted matching when the edge weights 
are given by independent, identically distributed, non-negative random variables. 
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Second, we obtain results on the joint convergence in distribution of the suitably 
normalized k largest eigenvalues for the preferential attachment tree. These results 
extend and sharpen those in |CLV03b[ [CLV03a[ IFFFOSi IFFF03j . where it was shown 
that the k largest eigenvalues are asymptotically equivalent to the square roots of 
the k largest out-degrees. The weak convergence of the suitably rescaled maximum 
out-degree was obtained in [M6r05J using martingale methods. However it is not 
clear how to extend this technique to describe the asymptotics for the k largest 
out-degrees for k > 2. We prove our more general results using an approach that 
is essentially completely different. 

2. Some representative random tree models 

An enormous number of random tree models have been developed by computer 
scientists working on the analysis of algorithms and the mathematical modeling 
of real world networks: see jAld91a[ IBha07j for a description of some of the more 
popular models. Although our methods apply quite generally, it will be useful to 
have the following models in mind when it comes to checking how the hypotheses 
of our results may be verified in particular instances. 

Random recursive tree: This is the simplest model of constructing a rooted tree 
sequentially via the addition of a new vertex at each stage. Start with a single vertex 
(the root) at time 1. Label the vertex added at stage n by n, so the tree 7^ that 
has been constructed by stage n has vertex set [n] :— {1,2, ... ,ri}. Construct the 
tree at stage n -I- 1 by adding an edge from vertex n -I- 1 to a vertex chosen uniformly 
among the vertices 1,2, ... ,n. We refer the reader to |SM94| for a survey of some 
of the properties of the random recursive tree. 

Linear preferential attachment tree: This is another sequential construction. 
As before, start with a single vertex (the root) at time 1. Suppose the tree on n 
vertices labeled by [n] has been constructed. Think of the edges as directed away 
from the root and let D{v,n) be the out-degree of vertex v G [n] at time n (that 
is, D(v,n) is the number of children of vertex v at time n). Construct a tree on 
n + 1 vertices via the addition of an edge between the new vertex n + 1 and the 
vertex v in [n] with probability proportional to D(v,n) -I- 1 -I- a, where a > —1 is 
a parameter of the process. There is an enormous amount of recent literature on 
this model. We refer the reader to [BR03|, IDur07|, IBha07j for relevant references. 

Uniform random rooted unordered labeled tree: By Cayley's theorem, there 
are n"~^ rooted trees on n labeled vertices (we think of trees as abstract graphs and 
so we don't consider a particular embedding of a tree in the plane when it comes 
to deciding whether two trees are "the same" or "different" - this is the import of 
the adjective "unordered"). Choose one of these trees uniformly at random. Since 
we are interested only in the structure of the tree, the labeling will be irrelevant. 

Random binary tree: There are various models of random rooted binary trees. 
The one we shall consider is the following sequential construction. Start at time 1 
with the three vertex tree consisting of a root and two leaves. At each stage, choose 
a leaf uniformly and attach two new leaves to it by two new edges. 
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3. Probability fringe convergence of random trees 

The key to understanding the asymptotic properties of the spectra of random 
trees such as those introduced in Section [2] is that they converge "locally" to appro- 
priate locally finite infinite trees. We define the relevant notion of local convergence 
in this section, and then show how it applies to the models of Section [2] 

We first need to be precise about what we mean by the term finite rooted tree. 
So far, we have talked about trees as particular types of graphs. That is, we have 
thought of a tree as being described by a finite set of vertices and a finite set of 
edges that are unordered pairs of vertices. A rooted tree has then been defined as 
such an object with a particular distinguished vertex that we call the root. This 
point of view is useful for describing constructions of random trees. However, we 
will often wish to consider two trees as being the same if they are isomorphic in the 
usual graph-theoretic sense: that is, if they have the same shape and only differ by 
a labeling of the vertices. A tree in this latter sense is thus an isomorphism class 
of trees thought of as graphs. When we wish to distinguish these two notions we 
will use standard terminology and speak of labeled and unlabeled trees, respectively. 
Continuing in this vein, we take two rooted trees (thought of as graphs) to be the 
same if there is a graph-theoretic isomorphism from one to the other that preserves 
the root, and we call the corresponding equivalence classes unlabeled rooted trees. 
Even more generally, we may consider unlabeled trees with several distinguished 
vertices. 

Let T be the countable space of all finite unlabeled rooted trees. Set T, = TU{*}, 
where * is an adjoined point. Equip T and T» with the respective discrete topologies, 
and equip the Cartesian products T°° and with the usual product topologies. 

Consider a finite unlabeled rooted tree t G T with root p and another distin- 
guished vertex v that is at distance h from the root (v may coincide with p, in 
which case h ^ 0). Let {v — vq,vi, . . . ,Vh = p) denote the unique path from the 
vertex v to the root. Write t^ for the subtree rooted aX vq = v that consists of 
all vertices for which the path to the root passes through wq, and for 1 < fc < /i, 
write tk for the subtree rooted at Vk that consists of all vertices for which the 
path from the root passes through Vk but not through Vk-i. Write $(t, •) for the 
probability distribution on TJ° that places mass {^t)^^ at each of the sequences 
{to, ti, ... ,tfi, *,*,.. .) G TJ° as V ranges over the #t vertices of t. It is clear that 
$ is a probability kernel from T to TJ°. 

Definition 3.1. Let be a sequence of random finite unlabeled rooted trees, 

and suppose that T is a T°°-valued random variable. The sequence con- 
verges in the probability fringe sense to T if the sequence $(7^, ■) of random prob- 
ability measures on converges weakly to the distribution of T in the topology 
of weak convergence of probability measures on T J° . 

Remark 3.2. The definition requires that the empirical distribution of the sub-trees 
below the various vertices of 7^ converges. However, it demands much more than 
this: for each fc > 1, the joint empirical distribution of the sub-tree below a vertex 
and the sub-trees below each of its k most recent ancestors must also converge. 

Remark 3.3. Note that any sequence {tQ,ti,...) S T°° may be thought of as a 
locally finite unlabeled rooted tree with one end (that is, with a single semi-infinite 
path) via the identification of the roots of ifc , A; G Z+ , as the successive vertices on 
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the unique semi-infinite path from the root. We call such trees sin-irees (for single 
infinite path trees). 

Remark 3.4. The terminology "probability fringe convergence" is not standard. In 
the literature, the convergence of the local structure around a uniformly chosen 
vertex of 7n to the structure around the root for some limiting random sin-tree 
is an instance of what has been termed "local weak convergence" by Aldous, see 
|AS04j . Our definition is somewhat stronger. 

A powerful technique for establishing probability fringe convergence of an en- 
semble of random trees is to first show that each member of the ensemble can be 
constructed as the family tree of a suitable stopped continuous-time branching pro- 
cess. (For us, a continuous-time branching process is the sort of object considered 
in |Jag89| : individuals give birth to a possibly random number of offspring at the 
arrival times of a point process up to a possibly infinite death time, and those 
offspring go on to behave as independent copies of their parent.) The next result 
describes such cmbcddings for the ensembles of Section [21 

Proposition 3.5. (a) [Random recursive tree] Consider a continuous time branch- 
ing process that starts with a single progenitor, individuals live forever, and individ- 
uals produce a single offspring at each arrival time of a unit rate Poisson process 
(this process is sometimes called the Yule process, but the usage of that terminology 
is not completely consistent in the literature) . Write J'{t) G T for the corresponding 
family tree at time t>Q. Set Tn '■= inf{t > : ft^J-{t) = n}. Then J-(Tn) has the 
same distribution as Tn, where Tn is the random recursive tree on n vertices. 

(b) [Linear preferential attachment tree] Consider a continuous time branching pro- 
cess that starts with a single progenitor, individuals live forever, and the point pro- 
cess representing the offspring distribution of any individual is a pure birth point 
process started at that can be described as follows: Whenever any individual has 
already given birth to k direct offspring, the individual produces a new offspring 
at rate fc -|- 1 -|- a. In particular, at the time an individual is born, the individ- 
ual generates new offspring at rate 1 -\- a. Thus, the times that elapse between the 
birth of an individual and the successive births of the individual's offspring, say 
{Pit02, ■■ ■), may be written as (3i = X]j=o'?i' where the successive rjj are indepen- 
dent exponential random variables and rjj has rate j -|- 1 -|- a. Each individual in 
the population has its own independent and identically distributed copy of the above 
offspring point process. Write J'{t) G T for the corresponding family tree at time 
< > 0. Set Tn '■— inf{t > : =^J-{t) = n}. Then, !F{Tn) has the same distribution as 
Tn, where Ti is the linear preferential attachment tree on n vertices with parameter 
a > -1. 

(c) [Uniform random rooted unordered labeled tree] Let Zoo be the complete family 
tree for a (discrete-time) Galton- Watson branching process with mean 1 Poisson 
offspring distribution. Note that Zoo is finite almost surely. The distribution of 
Zoo conditioned on f^Zoo — n is the same as that of Tn, where Ti is the objected 
obtained by taking the uniform random rooted unordered tree on n labeled vertices 
and removing the labeling. 

(d) [Random binary tree] Consider a continuous-time branching process that starts 
with a single progenitor, individuals live until a rate 1 exponential time, at which 
time they produce two offspring ( we will refer to this process as the random binary 
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splitting process^. Write T{t) G T for the corresponding family tree at time t>Q. 
Set Tn '■— inf{i > : ^J-'(t) = n}. Then, !F{Tn) has the same distribution as Tn, 
where Tn is the random binary tree on n vertices. 

Proof. Parts (a) , (b) and (d) follow from the comparison of the rates of the produc- 
tion of the offspring and the corresponding growth dynamics of the associated tree 
Tn . Part (c) is well-known and follows from randomly ordering the offspring of each 
individual to obtain an ordered (that is, planar) tree, computing the conditional 
probability distribution of the resulting rooted ordered tree, randomly labeling the 
vertices of the rooted ordered tree, and verifying that the randomly labeled tree 
is uniformly distributed using Cayley's theorem for the number of rooted labeled 
trees on n vertices (see, for example, (Ald91bj ). □ 

We now describe briefly the limiting sin-trees for the models considered above. 
Recall that a sin-tree can be thought of as an element of T°° . The following 
proposition follows from well-known results, and we give the appropriate references 
for each specific construction. 

Proposition 3.6. Each of the four ensembles of Section\^ converges in the prob- 
ability fringe sense, (as defined in Definition \3.1]) . The limiting random sin-tree 
for each model is described explicitly in Construction 

Construction 3.7. (a) [Random recursive tree: [JN84| INJ841 IAld91a| ] Let 

be independent rate one Yule processes. Let Xo,Xi,... be independent rate 1 
exponential random variables and put Si — '^^j^^Xj. Then, the limiting sin-tree 
has the distribution of {^i{Si))°^Q. 

(b) [Linear referential attachment: |NJ84i IJN841 IBhaOTj ] Let (Xi)J^o be indepen- 
dent exponential random variables, where Xq has rate 2-\-a and each Xi, i > 0, has 
rate l-\-a. Let be continuous time branching process that are conditionally 
independent given {Xi)°^Q, with the conditional distribution of J^i being that in 
part (b) of Proposition 13.51 subject to the minor modifications that the point pro- 
cess describing the times at which the root individual gives birth is conditioned to 
have a birth at time Xi and the offspring born at this time and all its descendants 
are removed from the population. All other vertices give birth to according to the 
original offspring point process. Then, the limiting sin-tree has the distribution of 

(•^.(e;=o^.)).=o- 

(c) [Uniform random tree: |Gri81j ] The limiting sin-tree has the distribution of an 
infinite sequence of independent copies of the critical Poisson Galton- Watson tree 
Zoo of part (c) of Proposition 13.51 

(d) [Random binary tree: [AldQlaj ] Let be independent random binary 
splitting processes as in part (d) of Proposition [575l Let {Xi)°^Q be independent rate 
1 exponential random variables and set Si — '^^j^qXj. Define T- valued random 
variables (Z^i)^o follows. Put Uo ~ J-o{So). For i > 1, Ui is constructed by 
attaching a new vertex pi to the root of !Fi{Si-i) and re-rooting the resulting tree 
at Pi. Then, the limiting sin-tree has the distribution of (Ui)f^Q. 
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Figure 1. Empirical distribution for the positive eigenvalues of 
the random recursive tree with 200 vertices, averaged over 200 
realizations. 



4. Statement of results 



4.1. Convergence of the spectral distribution. 

Theorem 4.1. Suppose that ('7^i)5^x sequence oj random finite unlabeled rooted 
trees that converges in the probability fringe sense to a sin-tree T. Let Fn denote 
the spectral distribution of the adjacency matrix of Tn ■ Then there exists a ( model 
dependent) deterministic probability distribution F such that F„ converges in dis- 
tribution to F in the topology of weak convergence of probability measures on R. 

Simulations of the expected value the spectral distribution for various finite 
random trees are shown in Figure [H Figure [2l and Figure [3l 

Remark 4.2. Recall that the graph Laplacian of a tree t with adjacency matrix A 
is the matrix A — D, where D is the diagonal matrix recording the degrees of the 
vertices of t (we caution the reader that some authors refer to the negative of this 
matrix as the Laplacian). The methods we use to establish Theorem 14.11 can also 
be used to show that if the sequence converges in the probability fringe 

sense, then the spectral distribution of the Laplacian matrix of Tn converges in 
distribution to a deterministic probability distribution on R. 



Remark 4.3. One of the interesting aspects of our proof of Theorem 14. II is that it 
does not employ the usual work-horse of random matrix theory, namely the method 
of moments. A standard method for establishing convergence of the spectral distri- 
bution f „ of an n X n random matrix An to a deterministic probability distribution 
F as n — > oo is to show for all fc > 1 that 
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Figure 2. Empirical distribution for the positive eigenvalues of 
the preferential attachment tree with 100 vertices, averaged over 
200 realizations. 




Figure 3. Empirical distribution for the positive eigenvalues of 
the random binary tree with 401 vertices, averaged over 100 real- 
izations. 



where the constants {ak)'kLi the moments of a unique distribution and then 
show that 



lim E 
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The following result shows that this strategy cannot be used for linear preferential 
attachment model with parameter a — and indicates why it is necessary for us to 
develop other techniques. 

Lemma 4.4. Let An be the adjacency matrix of the linear preferential attachment 
tree with a — 0. Then 



hm E 



oo. 



Proof. Recall that D(k, n) is the out-degree of the vertex fc in 7^ and that the vertex 
n + 1 is connected to the vertex k with probability proportional to D{k,n) + 1. 

Let I be the (random) vertex amongst the vertices {1,2, . . . to which the 
vertex (n + 1) is attached. Because n + 1 is a leaf in Tn+i, every path in 7^i+i 
from n + 1 to itself of length 4 is of the form n+l^T-^i^T^n+lior 
some ie{l,2,...,n + l} \ {T}, so that every path of length 4 in T^^+i from n + 1 
to itself corresponds to a unique path of length 2 in 7^i+i from 2 to itself. Thus, 
= A2_^i(X,X). Also, A2+i(J, J) = ^2 (1,1) + 1 because the paths 
of length 2 in %i+i from T to itself consist of the paths of length 2 in %i from T to 
itself plus the additional pathX n+1 T. Hence, A'^^i{n+1, n+1) > A^(X,X). 

Therefore, 

tr{At,+,) - tr(Afj > Af,+,{n + l,n+l)> Al(I,I) = D{I,n) + l^j^.y, 
and so, setting C„ = X]fe=i D{k, n)^ and using the fact that X]fc=i ^{^j ^) = n — 1, 

EMAi^,)] ~ E[tr(^fJ] > E[DiI,n)] ^ iJ(fc, n) ^^^^^"^ + ^ 

k=l 

= — - — mCJ + > — EfCJ. 

2n - 1 ^ 2n - 1 - 2n ^ ^ 

Now, 

n n 

—^EMAt,^,)] = ^J2 (E[tr(^f„+i)] -E[tr(ylfJ]) > E ^^^t^™]' 

m— 1 m— 1 

and so it will suffice to show that 

(4.1) lim -E[Cn] = oo. 

n — >oo fi 

Note that 

E[C„+i] - E[Cn] = E[D(n + 1, n + 1)^] + E[(l + D{I, n)f - D{I, nf] 



1 + 2E[D{I, n)] > 1 + -E[Cn] 



n 



and hence 



It follows that 



n 

E[C„+i] >n+y - 



m 

m—l 



liminfS>i + i,,i,fH^ 



and so (|4.ip holds, as required. □ 
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4.2. Atoms in the spectral distribution. 

Theorem 4.5. Let {'Tn)^^i a sequence of random finite unlabeled rooted trees that 
converges in the probability fringe sense to a sin-tree. For any 7 G M, Fn{{'j}) 
converges in distribution to a (model dependent) constant c-y as n 00. If F is the 
limiting spectral distribution guaranteed by Theorem \4.1\ then i^({7}) > c^. 

Theorem 4.6. Suppose that ("T^i)^! is a sequence of random finite unlabeled rooted 
trees that converges in the probability fringe sense to a sin-tree T = (T'^,T^, . . .). 
Consider a forest u composed of finitely many finite unlabeled rooted trees, and as- 
sume that some eigenvalue 7 of the adjacency matrix of u has multiplicity L > 1. 
Write lA for the random forest obtained by deleting the root ofT^ from , and sup- 
pose that ^{U = u} > 0. Then, the constant c-y of Theorem \4.5\ is strictly positive 
and hence 7 is an atom of the limiting spectral distribution F of Theorem \4-l\ 

Remark 4.7. The condition that the adjacency matrix of the forest u has an eigen- 
value 7 with multiphcity greater than 1 certainly holds if 7 is an eigenvalue of more 
than one of the trees that make up u. In particular, it holds if two or more of the 
trees that make up u are equal to some common tree t, and 7 is an eigenvalue of 
t. It is clear for the random recursive tree, the linear preferential attachment tree, 
and the uniform random tree, that if u is any forest of finite unlabeled rooted trees, 
then F{U = u} > 0, and so any number 7 that is the eigenvalue of the adjacency 
matrix of a finite tree will be an atom of the limiting spectral distribution F for 
these models. From Theorem 7 of |RR07aj . the eigenvalues of the adjacency matrix 
of the rooted tree in which every non-leaf vertex has out-degree d and each leaf is 
distance fc — 1 from the root are 

2Vdcos^-^^, j = l,...,k, e^l,...,j, 

with given multiplicities. It follows that the atoms of the limiting spectral dis- 
tribution F are dense in R for the random recursive tree, the linear preferential 
attachment tree, and the uniform random tree. A similar argument shows that the 
limiting spectral distribution for the random binary tree has a set of atoms that is 
dense in the interval [— 2-\/2, 2-\/2]. Because we can embed any binary tree into a 
complete binary tree of suitable height, we see that the limiting spectral measure 
has this interval as its support. 

Remark 4.8. In light of the previous remark, it is natural to inquire whether the 
limiting spectral distribution F is purely discrete or whether it also has a continuous 
component. Our methods do not suffice to resolve this question. 

4.3. The proportion of zero eigenvalues and maximal matchings. Theo- 
rem 14.51 and Theorem 14.61 show that the limiting spectral distribution F will typ- 
ically have many atoms. However, Theorem 14.61 provides a rather crude lower 
bounds on the mass of each atom. We obtain better lower bounds on the limiting 
proportion of zero eigenvalues in Subsection l6.3l The key tool we use is the intimate 
connection we recall in Subsection 16.11 between the number of zero eigenvalues of 
the adjacency matrix of a tree and maximal matchings on the tree - a notion that 
we now review briefly. 

Suppose that G is a graph with vertex set V and edge set E and for each edge 
e £ E there is a corresponding weight w{e). Recall that a matching of G is a subset 
oi S Q E such that no two edges in S share a common vertex. A matching S* 
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is maximal for the system of weights {w{e) : e G E} if X^ess* 'w{e) > X^ess "^i^) 
for any other matching S. There may be several maximal matchings but the total 
weight Xees* '^(s) is, of course, the same for all of them. When no weights are 
mentioned explicitly, they are assumed to be all 1, and the total weight of a maximal 
matching in this case is just the maximal possible cardinality of a matching. 

Although we only need the case when all the weights are 1 to investigate the 
proportion of zero eigenvalues, our methods establish the following more general 
result without much further effort. 

Theorem 4.9. Consider a sequence (7^)5^]^ of random trees that converge in the 
probability fringe sense to a random sin-tree T — [T'^ , . . .). Write Mn for the 
number of vertices ofTn and M{Tn) for the total weight of a maximal matching on 
Tn when the associated system of edge weights is a collection of independent and 
identically distributed Rj^^ -valued random variables X„(e) with a common distribu- 
tion V that has finite expected value. Then, M~^M{%i) converges in distribution 
to a (model dependent) constant cr.v as n —> oo. 

Using their objective method, Aldous and Steele [AS04| show that M~'^¥.[M{Tn)\ 
converges in the case of the ensemble of uniform random trees. Moreover, they 
characterize the limit in terms of the fixed points of certain distributional identities. 

4.4. Isospectrality. A result of Schwenk [Sch73| states that the probability the 
adjacency matrix of a realization of the uniform random tree has the same spectrum 
as some other (non- isomorphic) tree converges to one as the number of vertices goes 
to infinity. Schwenk's method was developed further in [BM93J. The key idea is 
to first establish that a certain pair of non-isomorphic finite rooted trees ti and 
t2 with the same number of vertices have the following exchange property: If t' is 
any finite rooted tree with ti as a subtree, then replacing ti by t2 produces a tree 
t" with the same adjacency matrix spectrum as that of t'. If one can then show 
that a given sequence is such that P{ti is a subtree of 7^} ^ 1 as n ^ oo, 
then P{7^ shares its spectrum with another tree} — > 1 as n ^ oo. Pairs of trees 
with exchange property are exhibited in [Sch73| IBM93| . Pairs of binary trees (that 
is, every non-leaf vertex has out-degree 2) with the exchange property are found 
in |ME06j . The following result is sufficiently obvious that we will not provide a 
proof. It applies to all four of the models in Section [51 with the pair ti,t2 being, 
for example, the binary trees in jME06| . 

Proposition 4.10. Consider a sequence {Tn)'^=i of random finite unlabeled rooted 
trees that converges in the probability fringe sense to a sin-tree T = {T^ , . . .). 
Suppose for some pair ti,t2 G T with the exchange property that PjT'^ = ti} > 0. 
Then, 

lim P{7^ shares its spectrum with another tree} — 1. 

n — *oo 

4.5. Largest eigenvalues and largest degrees. The following result is proved 
in |FFF05|.lFFF03ilCLV03"a] in the case a = 0. The proof extends readily to general 
a > -1. 

Theorem 4.11. Let {'T„)^^i be the ensemble of linear preferential attachment trees. 
Fix any k > 1. Write A„_i > A„,2 > . . . > Xn,k for the k largest eigenvalues of 
the adjacency matrix of Tn and denote by A„_i > A„^2 > • ■ • > ^n.k the k largest 
out-degrees of%i. Then, \n.i/ \/ ^n,i converges in distribution to 1 as n ^ oo for 
l<i<k. 
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We complement this result by establishing the following theorem. Recall that 
the linear preferential attachment model depends on a parameter a > — 1. Define 
the corresponding Malthusian parameter by 

(4.2) 7,:== a + 2. 

Theorem 4.12. There exist random variables Xi > X2 > ■ ■ ■ Xk > that such 
that 

Vnl/7a'„l/7a'---'„l/7aJ 

converges in distribution to {Xi, X2, ■ ■ ■ , Xk) as n — > 00. Hence, 
converges in distribution to \fX2^ . . . , \fXk) as n 00. 



5. Convergence of spectral distributions 

5.1. Interlacing inequalities and some of their consequences. Suppose that 
A is an TO X m Hcrmitian matrix and i? is an 7i x n principal sub-matrix of A for 
1 < n < m (that is, i? is formed by deleting m — n rows and columns of A with the 
same indices). 

Write ^1 < ... < for the eigenvalues of A and vi < ... < f„ for the 
eigenvalues of B. The interlacing theorem (see, for example, |HJ90j ) gives that 

tl'k<Vk< fj.k+m-n ioT 1 < k < n. 

Write P := ^ '^^i fo'" spectral distribution of A and Q '■— ^ X^ILi ^t^i 
for the spectral distribution of B. 

We wish to compare P and Q. To this end, we recall that the Levy distance 
between two probability measures a and r on M is given by 

d(cr, t) inf{e > : S{x - e) - e < T{x) < S{x + e) + s, Vx e M}, 

where S and T are the cumulative distribution functions of cr and r, respectively 
- see, for example, [ZolOl] . The Levy distance is a metric that metrizes weak 
convergence of probability measures on K, and the space of probability measures 
on K is complete with respect to this metric. Note also that if c ^ t, where ^ 
denotes stochastic domination (that is, S(x) > T{x) for all x e M), and a' and 
r' are two other probability measures with a < a' ^ t and cr ^ r' ^ r, then 
d{a',T') < d{a,T). Finally, observe that if X and Y are two random variables on 
the same probability space that have marginal distributions cr and t, respectively, 
then d{a,T) < V{X ^ Y}. 

We return to the comparison of P and Q. Define two new probability measures 
by n = i ELi and P* = ^ ELi ^^^.+^-,^■ It is clear that P, < P < P* and, 
by the interlacing theorem, that P^, < Q < P* . Thus, 

d{P,Q) <d{P,,P*). 

Suppose that on some probability space we have a random variable / that is uni- 
formly distributed on {1, . . . , n}. Define a pair of random variables {Z^, Z*) by 

{Z z*) - l^^^'^'^' on the event {/ > TO - n + 1}, 

' 1 {ill, l^n+i), on the event {/ < to — n}. 
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Note that has marginal distribution and Z* has marginal distribution P* . 
By construction, P{Z, 7^ Z*} < [(m — n) A 71] /n. We have therefore established the 
following result. 

Proposition 5.1. In the above notation, d{P,Q) < — 1) A 1. 

Corollary 5.2. Consider a sequence (^fe)^x of Hermitian matrices, with {Ak)'^i 
being ni^ x mk and having spectral distribution P^. For each e > 0, let (Bf.)^^ be 
such that B'f, is an n| x n| principal sub-matrix of Ak with spectral distribution Q\. 
Suppose for every e > that — limfc^oo Q| exists and limsupj._joo ruk/nj, < 
1 + e. Then, Poo — lim^j^oo Pk exists and is given by Poo — linie^o Q5o- 



Proof. From Proposition l5.ll 

hm sup d{Pk ,Pi) < lim sup d{Pk , Ql) 

k,£ — >-oo fc— *oo 

+ limsup(i((3|, Ql) + limsup(i(Q|, P^) 
< 2e 

for each £ > 0. The sequence {Pk)kLi is thus Cauchy in the Levy metric, and hence 
it converges weakly to a limit Poo • 
Moreover, 

rf(Poo,QL)= lim diPk,Ql)<e, 

k — >oo 

and so Poo = linieio Qlo- ^ 
Proposition 5.3. In the notation at the beginning of this subsection, 
|#{1 < fc < m : = 7} - #{1 < fc < n : i/fe = 7}| 
= |mP({7})-nQ({7})| 

< (m — n) 

for all 7 e M. 

Proof. Suppose that p = #{1 < k < m : ^k = 7}, with /ia+i = • ■ ■ t^-a+p = 7, 
and q = ^{1 < k < n : Vk = 7}, with Vh+i = . ■ . i^b+q = 7- It follows from the 
interlacing inequalities that Va+i < fJ-a+i, provided a + 1 < n, and Va+p-(m-n) < 
^Jia+p provided a + p — {m — n) > 1. Hence, q > p — [m — n). Similarly, v^^i < 
M6+i+(m-n) and p.b+q < n+q, SO that p> q- {m - n). Thus, \p - q\ < {ni - n), as 
required. □ 

Corollary 5.4. Consider a sequence {Ak)'^i of Hermitian matrices, with (Ak)'^-^ 
being nik x ruk and having spectral distribution Pk. For each £ > 0, let (P|)^]^ 
be such that P| is an n| x n| principal sub-matrix of Ak with spectral distribution 
Q|. Suppose for some fixed 7 S M that for every e > the limit lim^^oo Qki{"/}) 
exists anrf lim supj._,g^ mj; < 1 + e. Then, limfe_>oo ^^({7}) exists and is given 
by limeio linifc^oo QUil})- 



Proof. From Proposition [5?3l 

|m,P,({7}) - nlQU{j})\ < {mk - 4), 

and so 



mil}) - Q%{{i})\ <(i-^) + (^-i) 

\ mkj \nl J 
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An argument using completeness similar to that in the proof of CoroUarv IS . 2l finishes 
the proof. □ 

Corollary 5.5. Consider a forest u made up of finitely many finite unlabeled rooted 
trees, and assume that some eigenvalue 7 of the adjacency matrix of u has multi- 
plicity L. Suppose that A is the adjacency matrix of a finite unlabeled rooted tree t 
with m vertices, and suppose that there are K vertices v of t such that the forest 
formed by deleting v from the subtree below v produces the forest u. Then, 7 is an 
eigenvalue of the matrix A with multiplicity at least KL — m-\-{m — K) = K{L—1). 



Proof. The proof follows immediately by applying Proposition 15 . 31 to the matrix B 
that is the adjacency matrix of the graph obtained by deleting the K designated 
vertices from t. The matrix B is block diagonal, and some of its blocks can be 
collected into K identical larger blocks that each form a copy of the adjacency 
matrix of the forest u. It remains to observe that the set of eigenvalues of a block 
diagonal matrix is the union (including multiplicities) of the sets of eigenvalues of 
the respective blocks. □ 

5.2. Proof of Theorem 14.11 Suppose that the random tree %i has M„ vertices 
and adjacency matrix A„. 

Fix a positive integer K. The construction of several objects in the proof will 
depend on K, but our notation will not record this. 

Denote by Wn the set of vertices v oiTn such that the sub-tree below v (including 
v) contains at most K vertices. Put iV„ := ffWn- In the notation of Section [3l 
7V„/M„ = $(r„, {(to,ti, . . .) : #to < K}). 

In order to avoid conflicting notation, write the limit sin-tree T as {T^,T^, . . .). 
By the assumption of probability fringe convergence, iV„/M„ converges in distri- 
bution to the constant P{#T° < K}. The latter constant can be made arbitrarily 
close to 1 by choosing K sufficiently large. 

Denote by Un the subgraph of %i induced by the set of vertices Wn- That is, 
the graph Un has vertex set Wn and two vertices in Un are connected by an edge if 
they are connected by an edge in Tn- The graph Un is a forest. 

Write Xnk, 1 < k < K, for the set of vertices v of with the following two 
properties: 

• the subtree below v contains k vertices, 

• if w is first vertex (other than v) on the path to the root from v, then w 
is on the path to the root for more than K vertices (that is, the subtree 
below w contains more than K vertices). 

The set of roots of the trees in the the forest Un is the disjoint union [j'l^^i Xnk- 
Put Rnk '-= 4f^nk, so that Nn = X^fcLi ^^nk- It foUows from the assumption of 
probability fringe convergence that Rnk/Mn = $(7^, {(io, ^i, • ■ •) : #io = k, ffto + 
#ii > K}) converges in distribution to the constant pk ■= P{#T'' = k, #T" + 
> K}. Of course, the value of pk depends on K and may be 0. However, 

k=l fc=l 
Nn 

= lim — — 



= P{i^T" < K}. 
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Moreover, if we write 

:= ^^{Tn, ■ n {{toM. . . .) : #io = fc, #io + #ii > K}) 

for the empirical distribution of the subtrees rooted at the vertices in Xnk (with 
some suitable convention when Rnk = 0) , then S„fc is concentrated on the finite set 
of trees with k vertices and S„fc({t}) converges in distribution when pfc > to the 
constant 

Sfe({t}) := p{ro = 1 1 #r" = fc, #r" + > k} 

for each such tree. 

Denote by Aj. the distribution of an eigenvalue picked independently and uni- 
formly at random from the k eigenvalues (counting possible multiplicities) of the 
k X k adjacency matrix of a fc- vertex random tree with distribution S^. The prob- 
ability measure is concentrated on the finite set of real numbers that are the 
possible eigenvalues of some tree with k vertices. 

Write Bn for the adjacency matrix of the forest Un- This is a block diagonal 
matrix with Rnk many fc x fc blocks for 1 < k < K. Recall that the set of eigen- 
values of a block diagonal Hermitian matrix is the union of the eigenvalues of the 
blocks (including multiplicities). Thus, the spectral distribution of Bn converges in 
distribution to the deterministic probability measure 

EfcLi kpk 

as 71 — !■ oo. 

An application of CoroUarv 15.21 completes the proof. 

Remark 5.6. It is instructive to consider what the various objects that appeared in 
the proof look like in a simple example. Suppose that Tn is the deterministic tree 
with 2"+^ — 1 vertices in which every non-leaf vertex has out-degree 2 and each leaf 
is distance n from the root. We say that 7^ is a complete binary tree of height n. 
It is clear that Tn converges in the probability fringe sense to a random sin-tree 
{T'^,T^, . . .), where T° is a complete binary tree of height H with ¥{H = h} = 2~'^, 
h = 0,1, . . ., and T* consists of a root connected by an edge to the root of a complete 
binary tree of height H + i ~ 1 for i > 1 . 

If 2"+i - 1 > A' and ^ is the unique integer such that 2^+^ -1<K< 2^+^ - 1, 
then Wn is the set of vertices of Tn that are within distance at most £ of the leaves. 
Thus, Nn = 2"-^(2^+i - 1). Moreover, the set Xnk is empty unless fc = 2^+\ in 
which case Xnk is the set of vertices of Ti that are at distance exactly £ from the 
leaves and Rnk = 2""^. 

The sub-probability distribution {pk)k=i assigns mass 2~^ to 2^+^ — 1 and 
elsewhere, while the probability measure is the point mass at the complete 
binary tree of height h when fc is of the form 2''+^ — 1. The spectral distribution of 
Bn converges to the spectral distribution of the complete binary tree of height £. 

5.3. Proof of Theorem 14.51 The proof is almost identical to that of Theo- 
rem l4.1l in Subsection l5.2l Recall from that proof the constant K, the probabilities 
Pi, . . . ,pk, the probability distributions Xk, I < k < K, on R, and the random 
adjacency matrix Bn with distribution depending on K and n. Recall also that the 
probability measure Xk is concentrated on the finite set of real numbers that are 
the possible eigenvalues of some tree with fc vertices. 
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It follows from the argument in Subsection 15.21 that the mass assigned by the 
spectral distribution of Bn to 7 G M converges in distribution to the deterministic 
probability measure 

kpk 

as n — !■ 00. 

An application of Corollarv l5.4l completes the proof. 

5.4. Proof of Theorem I4.6i It follows from Corollary 15.51 that multiplicity of 7 
as an eigenvalue of the adjacency matrix of 7n is at least (L — 1) times the number 
of vertices v oiT„ such that the forest formed by deleting v from the subtree below 
V produces the forest u. By the assumption of probability fringe convergence, the 
proportion of eigenvalues of the adjacency matrix of 7^ that have the value 7 (that 
is, i^ri({7})) satisfies 

nPniil}) >{L- 1)¥{U = u} - e} ^ 1 

as n — > 00 for any e > 0. Moreover, because Fn converges weakly to F in distribu- 
tion by Theorem 14. II 

P{F({7})>F„({7})-e}^l 
as n — !■ 00 for any e > 0. Combining these observations establishes that 

F{M)>{L-l)¥{U = u}>0, 

as required. 

6. Maximal matchings and the number of zero eigenvalues 

6.1. Combinatorial preliminaries. The following lemma is standard, but we 
include the proof for completeness. 

Lemma 6.1. Consider a tree t with n vertices and adjacency matrix A. Let 5{t) 
denote the number of zero eigenvalues A. Then 

5{t) =n-2M{t), 

where M{t) is the cardinality of a maximal matching oft. 

Proof. It follows from the usual expansion of the determinant that the characteristic 
polynomial of the adjacency matrix of t is given by 

L«/2J 
k=0 

where Nh{t) is the number of matchings of t that contain k edges (see, for example. 
Addition Result 7b of |Big93| ), and the result follows immediately. □ 

Our analysis of the cardinality of a maximal matching for a tree relies on the 
following "greedy" algorithm for producing a maximal matching of a forest. It 
is a simplification of one due to Karp and Sipser |KS81| that is intended to find 
approximate maximal matchings of more general sparse graphs. The algorithm 
takes an initial forest and iteratively produces forests with the same set of vertices 
but smaller sets of edges while at the same time adding edges to a matching of the 
initial forest. We stress that a leaf of a forest is a vertex with degree one. 
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• Input a forest f with vertices V{{) and edges E{i). 

• Initialize S* <— 0. 

• While E{f)y^lD do 

* Choose a leaf, say x, and let {x, y} be the unique edge in f incident to 

X. 

* Set £;(f ) ^ {e G E{t) :en{x,y} = 0}, andS^SU {{x, y}} . 

• Output the matching S. 

Lemma 6.2. The algorithm produces a maximal matching as its output. 

Proof. Let x be any leaf of the forest, and write {a;, y} for the unique incident edge. 
Note that every maximal matching either contains the edge {x, y} or an edge of the 
form {y, z} for some vertex z ^ x, because otherwise {x, y} could be added to a 
putative maximal matching that contains no edge of the form {y, w} to produce a 
matching with a larger cardinality. Also, note that replacing any edge of the form 
{y, z} with z ^ X that appears in some matching by the edge {x, y} results in a 
collection of edges that is also a matching and has the same cardinality. It follows 
that the edge {x, y} must belong to at least one maximal matching. 

The result now follows by induction on the number of edges in the forest. □ 

Note that we are free to take any current leaf at each iteration of the "while" 
step of the algorithm. We start with some initial set of leaves and each iteration of 
the while step removes some leaves (by turning them into isolated vertices) as well 
as sometimes producing new leaves. We can therefore think of the leaves present 
after the completion of each while step as being labeled with the number of the 
step at which that vertex became a leaf, where the leaves in the initial forest are 
labeled with 0. We adopt the convention that in any iteration of the while step we 
take one of the current leaves with the lowest label. 

Put zo = and define ii,i2, ■ • ■ inductively by setting ik+i to be the number of 
iterations of the while step required until all of the leaves with labels at most ik 
are turned into isolated vertices, where ik+i = ik if the forest after ik iterations 
already consists of only isolated vertices. The numbers ik are eventually constant 
and this final value is the cardinality of a maximal matching. 

Each of the iterations + 1, . . . , ik+i of the while step is one or the other of the 
following two types. 

Type I: The iteration removes all of the edges of the form {y, z}, where the vertex 
y is not a leaf with label at most ik and there is a leaf x with label at most 
ik such that {y, x} is an edge (so that y is at graph distance 1 from the 
leaves of the forest present after ik iterations). 
Type II: The iteration removes an edge of the form {y, z} such that y and z are both 
leaves with label at most ik (we say that {y, z} is an isolated edge in the 
forest present after ik iterations). 

Therefore, the cardinality of a maximal matching is the number of vertices that 
will be at graph distance 1 from the current leaves after ik iterations of the while 
step for some k plus the number of edges in the initial forest that will eventually 
become isolated edges after ik iterations of the while step for some k. We next 
introduce some notation to describe the sets of vertices and edges we have just 
characterized. 

Write ik, Ek{i), Lk{i), and /fe(f), respectively, for the forest, the set of edges, 
the set of leaves, and the set of isolated vertices after ik iterations of the while 
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step starting from the initial forest f. Note that Ek{i) is obtained by removing all 
edges {y,z} e Ek-i{i) such that there exists x G Lfc_i(f) with {x,y} e Ek-i{i)- 
Equivalently, Ek{i) consists of exactly those edges {u,v} e £';;_i(f) such that both 
vertices u and v are at graph distance at least 2 from Lk-i{{) in tk-i- This means 
that vertices that are distance or 1 from Lfc_i(f) in ik~i are isolated in f^, and 
vertices that are at graph distance 2 or greater from Lk-i{t) have degree in equal 
to the number of their neighbors in fk^i that are at graph distance 2 or greater 
from ife_i(f). 

We further introduce new sets Gk{i), Hk{i) and Jfe(f) as follows: 
Gfc(f) {u e ifc(f) : 3v e ifc(f) so that {u, v} G Ekit)}, 

Hk{i) := {u e V{i) \ ifc(f) : 3v e Lfe(f) so that {u, v} G Ek{f)}, 
Jfc(f) := (4.+i(f)\4(f))\(Gfc(f)UiJfe(f)). 

In words, Gfc(f ) is the set of leaves that are one of the two leaves of an isolated edge 
present after ik iterations of the while step - these are the vertices that become 
isolated during iterations ifc + 1, . . . ,ik+i due to Type II steps, i?fe(f) is the set of 
vertices that are graph distance 1 from the leaves after iterations of the while 
step - these are the non-leaf vertices that become isolated during iterations ik + 
due to Type I steps, and Jfc(f) is the remaining set of vertices that 
become isolated during iterations it + 1, ... , ik+i (all due to Type I steps). Note 
that V{i) is the disjoint union of /o(f) and Gfc(f), i?fc(f), Jfe(f), k > 0, and so 

oo 

#y(f) = #/o(f) + (#Gfc(f) + #i/fe(f) + #jfe(f)) . 

fe=0 

Clearly, all the above objects can also be defined for an infinite forest f such that 
every vertex is at a finite graph distance from a leaf, (that is, a vertex of degree 
one). 

The discussion above leads immediately to the following result. 
Lemma 6.3. The cardinality of a maximal matching of a finite forest f is 

M{i) = J2*Hk{i) + -J2#Gk{i). 

k=0 k=0 

Consequently, the number of zero eigenvalues of the adjacency matrix of a finite 
tree t is 

oo oo 

S{t) - W{t) - 2M{t) = ^#Jfe(t) - ^#i7,(t). 

A;=0 fc=0 

Example 6.4. Consider the tree t with vertices {!,..., m} and edges connecting 
successive integers. The cardinality of a maximal matching is obviously (m — l)/2 
when TO is odd and to/2 when is even (so that d{t) is 1 when to is odd and when 
is even). There are four cases to consider in checking that this agrees with the 
formula of Lemma 16.31 

Case I: to is odd and (to — l)/2 is odd (<;=> to = 3 mod 4). 

Then, Hoit) = {2, to - 1}, ffi(t) = {4, to - 3}, . . ., 7?(„,_3)/4(t) = {(to + l)/2}, 
all other Hk{t) are empty, and all Gfe(t) are empty. The formula of Lemma [6.31 
gives 2 X (to - 3)/4+ 1 = (to - l)/2. 

Case II: to is odd and (to — l)/2 is even (<^ to = 1 mod 4). 
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Then, Ho{t) = {2,m-l}, i/i(t) - {4,m-3}, . . ., i/(™_5)/4(t) = {(m - 

1) /2, (m + 3)/2}, all other Hk(t) are empty, and all Gfe(t) are empty. The formula 
of Lemma [Ql gives 2 x ((m - 5)/4 + 1) = (m - l)/2. 

Case III: m is even and (m — 2)/2 is odd (<S=> m = mod 4). 

Then, Hoit) = {2,m- 1}, Hi{t) = {4,m- 3}, . . ., iJ(,„_4)/4(t) = {m/2, (m + 

2) /2}, all other Hk{t) are empty, and all Gk{t) are empty. The formula of 
Lemma 1631 gives 2 x ((to — 4)/4 + 1) = to/2. 

Case IV: m is even and (m — 2)/2 is even (<^ to = 2 mod 4) 

Then, Ho{t) = {2,to-1}, iJi(t) = {4,to-3}, . . ., i/(™_6)/4(t) = {(m - 
2)/2, (m + 4)/2}, aU other Hk{t) are empty, G(m_2)/4(t) = {(to/2, (to + 2)/2}, and 
all other G'fc(t) are empty. The formula of Lemma [6?3l gives 2x(fm — 6)/4+l) + l = 
to/2. 

6.2. MsLximal weighted matchings: Proof of Theorem 14. 9i We will use the 
same construction as we used in the proof of Theorem 14. II in Subsection 15.21 

Recall that for a fixed positive integer K this construction produced for each n 
a set of vertices Wn of Tn with cardinality Nn such that A^„/Af„, where M„ is the 
number of vertices of %i, converged in distribution to P{#T° < K} - a constant 
that can be made arbitrarily close to 1 by choosing K sufficiently large. 

The subgraph of Tn induced by Wn was the forest Un rooted at the points 
IJ^-i^ Xnk and #X„fe/M„ converged in distribution to the constant pk := P{#T° = 

fc, + #ri > K}. 

Moreover, the random probability measure S„fc given by the empirical distribu- 
tion of the subtrees rooted at the vertices in X„k was concentrated on the finite set 
of trees with k vertices and S„fc({t}) converged in distribution when > to the 
constant 

Sfc({t}) := p{ro = 1 1 #r" = k, i^T'' + #ri > k} 

for each such tree t. 

Write M{Tn) (respectively, M{Un) for the total weight of a maximal matching 
on %i (respectively, Lin) for the independent, identically distributed edge weights 
X„(e), where e ranges over the edges of 7^. 

Note that a maximal matching on lAn is obtained by separately constructing 
maximal matchings on each component subtree of Lin- It follows from Lemma 16.51 
below that M~^M{Un) converges in distribution to 

K 

k=l t:#t=/c 

where ^(t) is the expected value of the total weight of a maximal matching on 
t when the weights of the edges are independent and identically distributed with 
common distribution v. 

Observe that any matching on lAn is also a matching on Tn and that the restriction 
of any matching on 7^ to Un is a matching on Un- Thus, 

M{Un) < M{Tn) < M(Un) + 

where E{Tn) (respectively, EiUn)) is the set of edges of 7^ (respectively, Un)- 

There is an element of E{Tn)\E{Un) for each vertex of Tn other than the root 
that is not a vertex of Un and one for each root of a subtree in the forest Un- Thus, 
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writing /i for the common expected value of the edge weights, 

K 



E 



E 

e£E{r„)\E{U„) 



(M„-7V„-l)++^#X„fc 





Note from above that 1 — M^'^Nn converges in distribution to the constant 
P{#T° > K} and M~^^^^^4I^Xnk converges in distribution to the constant 
Y.k=iPk = < K, i^T^ + ifT^ > X} as n -> oo. Both of these constants 

converge to as X ^ cx). It foUows that 



hm hm P <^ M-^ V X„(e) > e i = 
ee£;(r„)\£;(w„) J 

for aU £ > 0. 

Therefore, M^^M{%i) converges in distribution as n ^ oo to the constant 

K 

hm Y.P^ 2fc({t})/i(t), 

A— *oo ^ ^ ^ ^ 

k=l t:#t=k 

where we stress that pk and depend on K, even though this is not indicated by 
our notation. 

The foUowing lemma, which we used above, is a straightforward consequence of 
the strong law of large numbers. 

Lemma 6.5. For i — 1,2, .. . let U he a positive integer-valued random variable 
and . . . , he random variables taking values in a finite set Q. Suppose that as 
i oo the random variable U converges in distribution to oo and for each 6 Cz Q 
the random variable 

#{1 < J < : = 9} 
L' 

converges in distribution to a constant n(0). Let Cii ■ ■ ■ iCl^ R^-valued random 
variables that are conditionally independent given 6\, . . . , O^j^^, and such that 

nCjeA\e\,...,6i^}^n{e]-A) 

for some collection of Borel probability measures (11(0; •))ege- Suppose that 

v{e) / xn{e;dx) < oo 

for all Q. Then, 

E.7 = l 



converges in distribution to 



TT{9)v{e) 



oo. 



Remark 6.6. Let a sequence (7^)^i of random unlabeled rooted trees converge 
in the probability fringe sense to a random sin-tree T. Let M„ be the number of 
vertices in 7^. Consider the case when each edge- weight is identically one. Write 
I{Tn) for the cardinality of a maximal independent set for 7^. By Konig's theorem 
[BM76| , for a general bipartite graph the cardinality of a maximal matching is equal 
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to the cardinality of a minimal vertex cover. On the other hand, complementation 
of a minimal vertex cover in any graph always yields a maximal independent set. 
Thus, I{Tn) = Mn — M{Tn) in our case. Consequently, M~^I{%i) also converges 
in distribution to a (model-dependent) constant nq- >l/2asn— s-cx). 

6.3. Asymptotics of the number of zero eigenvalues. If we combine The- 
orem 14.91 on the rescaled convergence of the total weight of a maximal weighted 
matching with Lemma 16.11 on the connection between the cardinality of a maximal 
matching and the number of zero eigenvalues of the adjacency matrix, then we get 
another proof of Theorem 14.51 on the convergence of fn({7}) in the special case 
when 7 = 0. We now improve this result by using Lemma l6.3l to give a formula for 
the limit in terms of features of the limit sin-tree. We then show that how this 
formula may be used to get explicit lower bounds on the limit. 

Proposition 6.7. Consider a sequence (7^)5^1 of random unlabeled rooted trees, 
where %i has Af„ vertices. Suppose that {Tn)'^=i converges in the probability fringe 
sense to a random sin-tree T = {T^ , , . . .) and write R for the root ofT'^. Then 
i^„({0}) converges in distribution as n oo to 

oo 

5](p{i?e Jfe(r)}-P{i?ei?fc(r)}). 

fe=0 

Proof. In view of Theorem 14.51 its enough to prove the convergence of F„({0}) to 
the desired quantity in expectation. If y is a vertex chosen uniformly at random 
from %i, then, by Lemma [6.31 we can write E[F„({0})] = M^^E[S{Tn)] as 

oo 

(6.1) 

= J2 i^i"^ e JkiTn)} - P{V e HkiTn)}) . 

Given a tree t e T with root p and a vertex v G t, write A4 {v, t) for the subtree of 
t induced by vertices that are at graph distance at most k from v. Note that whether 
or not a vertex w of t belongs to the sets Hk{t) or Jfe(t) can be determined by exam- 
ining the neighborhood M2k+A{v, t). Observe also that (io, ^i, i/t, *,*,■• ■) G 
is the decomposition of t relative to p and then Mk{v,t) can be reconstructed 
from (to,ii, ■ • • ,tfcAh)- 

Recall that Jk{%i) and Hk{Tn) are both subsets of Ik+i{%i) \ Ik{%i), and so 

\¥{V e Jfc(7;)} - P{V e iffc(T„)}| < P{V e 4+i(r„) \ 4(r„)}. 

Moreover, for any nonnegative integer m, 

CXj 

^p{ye4.+i(r„)\4(7;o} 

k—m 

= p{T/e F(r„)\/™(r„)} 

< P{the subtree of %i below V contains at least 2m — 1 vertices } 

^ P{#r° > 2to- 1} 

as n ^ oo, by the assumption of probability fringe convergence. The last term 
clearly converges to as m — s- oo. A similar argument shows that the analogous 
series involving the limiting sin-tree is also absolutely convergent. 



24 



SHANKAR BHAMIDI, STEVEN N. EVANS, AND ARNAB SEN 



Finally, it follows from the assumption of probability fringe convergence and our 
observations above about membership of Hk{T) and Jk{T) being locally determined 
that for each fc > 1, the first k terms of the series (j6.ip converge to the corresponding 
terms of the desired infinite series involving the limiting sin-tree. □ 

By construction, for any tree t, Gfc(t) C Lfc(t) and Lk{t) \ Gfc(t) C Jfc(t). 
Set Kkit) := Jfe(t) \ {Lk{t) \ Gfc(t)). That is, Kk{t) consists of vertices that 
become isolated due to Type I steps during iterations «fc + 1, . . . , ik+i of the Karp- 
Sipser algorithm but are not leaves in the forest present after iteration i^; for 
example, if t has vertices {1,2,3,4,5} and adjacent integers are joined by edges, 
then Jo(t) = {1,3,5}, Lo{t) \ Go(t) = Lo{t) = {1,5}, and Ko{t) ^ {3}. Note 
that for each v G -fffc(t) there exists u G Lk{t) \ Gfc(t) such that {v,u} e Ek{t). 
Also, if vi,V2 are distinct elements of Hk{t) and wi,U2 G ife(t) \ Gfc(t) are such 
that {wi, til}, {w2, "2} € Ek{t), then ui and U2 are also distinct. Consequently, 
#Lfc(t) — #Gfc(t) — ^Hk{t) > 0. Applying this observation to 7r„, dividing by A/„, 
and taking the limit as n — > 00, we deduce that the formula in Proposition IG . 7l for the 
limit of Fn{{0}) may be written as a sum over k of the sum of the two nonnegative 
terms P{R e ifc(T)} - P{R G Gfc(r)} - P{R g iJfc(r)} and F{R e KkiT)}. We 
may give good lower bounds for the first few of these summands with relative ease. 

We first find lower bound on F{R e Lo{T)} ~ ¥{R e Go{T)} - P{R e Ho{T)}. 
Note for any tree t with 3 or more vertices that Go(t) = 0. Observe also that 

#Lo(T„) - #i/o(T„) 

00 

= (to — 1) X #{ii e Ho{Tn) ■ u is connected to exactly to vertices in Lo(^)} 

m=2 
00 

— 5Z ("^ ^ 1) ^ #{w G y{%i) ■ the subtree below u is an TO-star}, 

m=2 

where by a TO-star we mean a unlabeled rooted tree with (to -1- 1) vertices in which 
the root is connected to each of the other to vertices via an edge. Therefore, 

¥{R e Lo(r)} - P{R e Ho{T)} = lim M-\E[#Lo{Tn)] - E[#ffo(T„)]) 

00 

- ^i"^^^) ^ ^{'^^ is an m-star}. 

m=2 

On the other hand, it is easy to check that P{i? e -ftro(^)} > E T", e T'}, 

where T' C T is the set of finite unlabeled rooted trees for which the root has at 
least one child that has no children, and T" C T is the set of finite unlabeled rooted 
trees for which the root has single child and that child in turn has at least one child 
that has no children. 

As one might expect, finding good lower bounds on the terms P{R G Lk{T)} — 
P{R G Gfc(T)} - V{R G Hk{T)} and P{i? G Kk{T)} becomes increasingly difficult 
as k gets larger. However, we can still get crude lower bounds by computing the 
probability of appearance of special kinds of trees in the first few fringes in the 
limiting sin-tree. For example, 

¥{R G Lk{T)} - F{R G Gk{T)} - F{R G Hk{T)} 
> P{T" is a complete binary tree of depth {2k + 1) 
and = • for 1 < i < 2fc - 2}. 
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where • denotes the rooted tree with a single vertex. The proof fohows along the 
same lines as the k = case above. Furthermore, 

V{ReKk{T)} 

> P{r° is a path of length {2k + 2) , 

= • for 1 < i < 2k, 

and T^'^+i is a 1-star}. 

For the ensemble of linear preferential attachment trees with parameter a = 0, it 
is well known (see, for example, [DurO? ) that the proportion of vertices with degree 
d converges in distribution to pd — 4:/d{d + l)(d + 2). Specializing to d = 1, we see 
that n^^^Lo{Tn) converges in distribution to 2/3, and so F{R E Lo{T)} = 2/3. 
Hence, 

lim F„({0}) > ¥{R e LoiT)} - F{R e Ho{T)} > 2F{R e Lo(r)} - 1 = 1/3. 

n — ^oo 

Now consider the ensemble of random recursive trees. Recall Construction l3 . 7{ a) . 
Let CiiCii* ^ 1 ^i^d X be i.i.d. exponential random variables with rate 1. To get 
a lower bound on lim„^oo ^n({0}), we may use the inequality lim„_>oo -Pri({0}) > 
Sm=2('^ — 1) X P{T° is an m-star} where 

^ m m+1 ^ m 

Y.^^<x,J2i.>x\x\l[p{^i>x\x} 

< 2—1 i—l ) i—1 

ni m+1 



P{r" is an m-star} > E 



E 



E 



1=1 



X 



-mX 



-mX 



m! 



(m-K2)-(™+i). 



For the uniform random trees, we can easily obtain lower bounds for various 
terms using the description of the fringes of the limiting sin-tree in terms of critical 
Poisson Galton- Watson trees. For example, 

g-i e~(™+i) 

P{r" is an m-star} = _ x (e^^)" = ■ — , 

m! m! 



P{T^ e T'} = 1 - ^ — X (1 - = 1 

i=0 

and 

P{T° e T"} = X (1 - e-^e^-""''). 

Therefore, 

oc 

limi^„({0}) > V (m - 1) X F{T° is a m-star} + P{r° e T", e T'} 

71 ' ^ 



m^2 



e-\l - (1 - e-i)e^ ) + 6^1(1 - e-'e'-'^ f 
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7. Largest eigenvalues: Proof of Theorem 14.121 

We first recall from Proposition l3.5r b) how Tn , the linear preferential attachment 
tree on n vertices with parameter a > — 1, can be constructed from a particular 
continuous-time branching process. 

Denote by Na = {Na{t))t>o a pure birth process that starts with a single progen- 
itor and when there have been k births a new birth occurs at rate k + I + a. Recall 
that J^(t) G T is the family tree at time t > of the continuous-time branching 
process in which the birth process of each individual is a copy of Na- Then, Tn has 
the same distribution as J-{Tn), where T„ := mi{t > : #J-{t) ~ n}. 

We now record some useful facts about the birth process Na- Recall the Malthu- 
sian rate of growth parameter "fa '■= a + 2. 

Lemma 7.1. (a) For any fixed time t > 0, the random variable V{No{t) ~ k} — 
(1 — e^*)'^e^*, fc = 0, 1, . . .. That is, No{t) is distributed as the number of failures 
until the first success in a sequence of independent Bernoulli trials with common 
success probability e^*. 
(b) For a > -1, set A := [a + 1] . Then, 

¥{Na{t) > Ke^} < Ae-^ 

for all K > and t>0. 

Proof. For part (a), note that A'o -I- 1 is a Yule process - the birth rate in state £ is 
£ - and the claimed distribution is well-known. 

To prove part (b), suppose that M ~ {AI{t))t>o is a Yule process started in 
state A (that is, M is pure birth process and the birth rate in state ^ is ^). Then, 
{M{t) — A)t>o is a pure birth process that starts in state and has birth rate 
£ + A>£+l + am state £. It is therefore possible to couple M and Na in such a 
way that Na{t) < M{t) — A for all i > 0. Observe that M has the same distribution 
as X]i=i(-^o + 1)' ^^"i M — A has the same distribution as ^o- We could 

prove (b) using the fact that M(t) is distributed as the number of trials before the 
A*** success in in a sequence of independent Bernoulli trials with common success 
probability e~*, but it is more straightforward to use a simple union bound. 

Observe that from part(a) and the inequality 1 — a; < exp(— x) that, for any 
C> 0, 

P{iVo(i) > Ce*} = (1 - e-*)LC'='J+i 

< (exp(-e-*))LC^^'J+i 
= exp(-e"*(LCe*J -f 1)) 

and hence, 

^ K 

n^akt) > Ke'} < Y^nNi^it) > ^e*} 

4=1 

K 

< Ae ^ . 



□ 
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Theorem 7.2. (a) There exists a random variable Za > such that 



lim — = Za almost surely. 




(h) There exists a constant C such that EyfJ-{t)] < Ce^'^*' . 
(c) For the random variable Za of part (a), 



lim T„ log n = — log Za almost surely. 

(d) There exists a random variable Wa > such that 



lim — — Wa almost surely. 

t — >oo 



Proof. Parts (a) and (b) (in a more general context) can be found in |Blia07j . so 
we shall not give the proof here. They essentially follow from the general theory of 
continuous time branching processes developed by Jagers and Ncrman. 

Part (c) follows immediately from part (a) and the relationship #J^(T„) — n. 

Turning to part (d), note that 



is a local martingale with bounded variation. Stochastic calculus shows that the 
process (e~* • {Na{t) + 1 + a))t>o is also a local martingale with bounded variation. 
The fact that the latter process is bounded in and hence, in particular, a true 
martingale follows from Lemma iT.lT b). 

It follows from the martingale convergence theorem that e^*Na{t) converges 
almost surely and in to a random variable Wa. 

It remains to show that Wa is strictly positive almost surely. Consider first the 
case a > 0. From a comparison of branching rates similar to that in the proof of 
Lemma mi it is possible to couple Na and A^o so that No{t) < Na{t) for alH > 0. 
Note that Wq has an exponential distribution with mean 1, and so Wa is certainly 
almost surely positive. 

Consider now the case — 1 < a < 0. Let Na be Na started in the initial state 
1 rather than 0, and put Na = Na ~ 1. Another comparison of branching rates 
shows that it is possible to couple Na and A^o so that N^it) < Na{t) for all t > 0. 
Thus, limj^oo e~*Na(t) is stochastically greater than the strictly positive random 
variable e~'^Wo, where the random variable r is independent of 14^0 and has the 
same distribution as the time taken for Na to go from to 1 (that is, r has an 
exponential distribution with rate 1 + a). □ 

Fix k > 1. Recall that A„,i > A„_2 > • • • > ^n,k are the k largest out-degrees 
in Tn (the out-degree of a vertex is its number of children) . We will show that the 
vertices with these out-degrees occur in a finite neighborhood about the root and 
that the out-degrees of vertices in a finite neighborhood about the root converge in 
distribution when properly normalized. 

Lemma 7.3. In the branching process construction of the linear preferential at- 
tachment tree Tn, let i denote the maximum out-degree in Tn among all vertices 
born before time S . Given any e > 0, there exists a finite constant such that 




liminf P{A: 



-11,1 — 



A„,i} > 1 - 
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Proof. Fix e > 0. Write An,s for the event 

There exists a vertex in Tn which was born after time S and has 
out- degree greater than the root. 

The claim of the lemma may be rephrased as a statement that there is a finite 
constant such that 

limsupP(A„,sJ < £■ 

n — >oo 

Note from Theorem 17. 2( 0*1 that there is a constant such that 



lim sup I 



Tn - — log n 

7a 



> bA < e/2. 



Set := logn — B^ and <" := ^ logn + B^. It is enough to prove that there 
exists a finite constant Sf, such that 

limsupPK^gJ <£/2, 

n — >oo 

where A'^ g is the event 

There exists a vertex born after time S that has out-degree greater 
than the root for some time t in the interval [t^ , ] . 
Furthermore, since the out-degrees of vertices increase with time, it is enough to 
show that 

limsupP«,sJ<e/2, 

n — >oo 

where A[[ g is the event 

There exists a vertex born after time after time S such that the 
out- degree of the vertex at time t" is greater than the out- degree of 
the root at time tl . 

For t > and a time interval / C [0, t] denote by Z{I, t) the maximum out-degree 
at time t of all vertices born in the time interval /. Let Cp(0 denote the out-degree 
of the root at time t. Note that 

<5 = {z([5,r;],f;)>Cp(r)}. 

Observe also, that for any constant 

nKs) <^{Cpm < K or Z{[S,tlUl) > K} 

< P{Cp(<-) < K}+F{Z{[S„tl],tl) > K}. 
It thus suffices to show that there is a sequence Kn and a constant Se such that 

(7.1) limsupP{Cp(r) < Kn} < £/4 

n — ^oo 

and 

(7.2) limsupP{Z([5,f;],f;) > if„} < e/4. 

n — *oo 

It follows from Theorem I7.2r d) that the inequality (|7.ip holds with Kn = 
X^ji^/io. fQj. suitable constant K^ > 0. 

Turning to the inequality (|7.2p . assume without loss of generality that S and t^ 
are integers. In that case, 

Z{S,tl)= max Z([m,TO-t- l],t"). 

S<ni<t^~l 
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Note that, by the union bound, 

P{Z([m, m + l],tl) > K„} < E[#^(m + 1)] V{Naitl - m) > K„}. 
Applying Theorem 17. 2r b') and Lemma FTlT b) gives 

ti-i 

where C" = K^/iAe^^). The inequahty foUows upon choosing 3 = 3^ large 
enough. □ 



A slightly more detailed analysis shows that Lemma 17.31 can be generalized to 
the k maximal out-degrees for any fixed k. Let A^^-^ > Af ^ > • • • > ^ be the 
k largest out-degrees in 7^ from among the vertices that are born before time 3, 
with the convention that A.f'" = for z > ^!F{3) when ^!F{3) < k. We leave the 
proof of the following result to the reader. 

Lemma 7.4. For any e > there exists a finite constant 3^ such that 
lim^inf P{(Af;i, . . . , Af^,) = (A„,i, . . . , A„,fc)} > 1 - e. 

Proposition 7.5. Fix 3 > and consider the marked tree T^f constructed by 
marking each vertex v of the tree J- {3) with n^^^^" D{v, n) £ R-^, where D{v, n) is 
the out-degree of v in%i. Then, T^f converges almost surely as n oo to the tree 
J- {3) equipped with a set of marks that are strictly positive almost surely. 

Proof. For any vertex v g ^{3), write Cvit) for the out-degree (that is, number 
of offspring) of v at time t, so that Cv{Tn) — D{v,n). Note that Cv{3) can be 
computed by only looking at ^{3) (recall that our trees are rooted). Conditional 
on J^{3), the processes := {Cv{3 + t) — (^y{3))t>o, v € ^{3), are conditionally 
independent. Note that the conditional distribution of (^y is that of a pure birth 
process that starts in state and has birth rate Cv{3) -I- ^ + 1 -I- a in state £. It 
follows from Theorem I7.2r d') that e~*Ci,(t) converges almost surely as t ^ oo to 
a random variable that has a conditional distribution which is that of the strictly 
positive random variable Wa+Q^(s)- Hence, by Theorem I7.2f c). 

lim e-(^"-^)C4r„ -S)= lim e^Z,n^i/^''C„(r„ - 3) 



exists almost surely and the limit is strictly positive almost surely. 

The result follows because n-^/^-D{v,n) = n-^/^-{Cy{3) + CviTn - 3}). 



□ 



Corollary 7.6. The random vector n ""^/^"(A;^ i, • ■ ■ , A^ j.) converges almost surely 
to a random vector {Yf, Y2 , ■ . ■ , Yjf ) as n — > cx), where Yf >Y2^ > ...> Y^ > 
almost surely. 

Completion of the proof of Theorem 14.121 Given Corollary 17.61 and 
Lemma 17. 4[ the proof is completed by applying the following elementary result 



with Xn,^ = n-i/^-An,, and Y^^ = n'^^-^'A 
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Lemma 7.7. Let (X„)J^j ~ {{Xn,i, ■ ■ ■ , Xn,k))?^=i be a sequence of M.'' -valued 
random variables. Suppose for each fixed e > that there exists a sequence o/M*^- 
valued random variables (yI'^'')^]^ = {{Y^ i, . . . on the same probability 

space such that 

liminf P{X„ ^ Y^f)} > 1 - e. 

n — >oo 

Suppose further that for each e > there exists a random vector Y^-* such that 
y1^' converges in probability to Y^^ as n —> oo. Then, there exists an MJ" -valued 
random variable Xqo such that X„ converges in probability to Xqo as n —^ oo. 

Proof. Convergence in probability for the space of R*^ -valued random variables is 
metrized by the metric r, where r(X, Y) := E[|X — Y| A 1] and where | • | denotes the 
Euclidean norm on W'. Moreover, this metric space is complete. By assumption, 

limsupr(Xm,X„) < limsupr(X„, Y^^^) 

m,?T, — '■oo rn^oQ 

+ lim sup r (Y(^) , Y(f ) ) + lim sup r (Y^,^) , X„) 

< 21imsupP{Y,^, 7^X„} 

n — >oo 

< 2e. 

The sequence (X„)5^]^ is thus Cauchy in the metric r, and hence it converges in 
probability to a limit Xoo • D 

8. An example 

Consider the following construction of a random finite graph Qn foi' a given 
positive integer n. The vertices of Gn are {l,...,2n}. Let £„i, be in- 

dependent, identically distributed random variables, with V{enk = 0} = g and 
P{£nfc = 1} = 1 ^ 9, where < q < 1. There is always an edge between the vertices 
2k — 1 and 2fc -|- 1 for 1 < fc < n — 1, and there is an edge between the vertices 
2fc — 1 and 2fc for 1 < fc < n if and only if e„fc = 1. There are no other edges. 

The graph t/„ consists of a large connected component 7^ with vertices 

{1, 3, . . . , 2n - 1} U {2fc : 1 < fc < n, e„k = 1} 

plus the (possibly empty) set of isolated points {2fc : 1 < fc < n, Snk = 0}. The 
graph 7^ is a tree. It resembles a comb with some of the teeth missing, see Figure ID 
No matter how we chose a root, the sequence of random finite trees (7^)^i 
would not converge in the probability fringe sense because the cardinality of the 
subtree below a uniformly chosen point would not converge in distribution to a 
finite random variable. However, if we look at the empirical distribution of the 
tree within graph distance fc of a vertex v as v ranges over the 7^, then it is not 
hard to see that this random measure converges to a deterministic limit for every fc. 
This observation and the fact that the vertices of 7^ have degree at most 3 shows 
that the moments of the spectral distribution of 7^ converge as n ^ oo to finite 
constants, and these constants are the moments of a distribution that is determined 
by its moments. Therefore, the spectral distribution of 7^ converges in distribution 
to a deterministic limit as n — > oo, and in order to compute the moments of that 
limit, it suffices to compute the limits of the expectations of the moments of the 
spectral distribution of 7^ 
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Figure 4. In the above graph Qn, every horizontal edge always 
occurs whereas every vertical (dotted) edge occurs independently 
with probability 1 — q. The comb tree 7^ is the unique connected 
component of Gn that is not an isolated point. 



Write Zn = #{1 < k < n : Snk = 0}. By permuting indices, it is possible 
to re-write the adjacency matrix of Gn in block form, where the upper-left block 
has dimensions (2n — Z„) x (2n — Z„) and is the adjacency matrix of 7^, while 
the lower-right block is the zero matrix of dimensions Zn x Z„. Therefore, if wc 
write Fn (respectively, for the empirical distribution of the eigenvalues of the 
adjacency matrix of 7^ (respectively, Gn), then 



where 6o is the unit point mass at 0. Since Zn/n converges in probability to q as 
n ^ oo, the limiting behavior of F„ is determined by that of Hn and vice-versa: 
Hn converges in probability to a non-random probability measure H and 



where the probability measure F is the limit of the sequence Fn- 

Define a random infinite graph G with vertex set Z as follows. Let Sk, k € Z, 
be independent, identically distributed random variables, with P{efc = 0} = g' and 
P{£fe = 1} = 1 — q. There is an edge between the vertices 2k — 1 and 2fc + 1 for 
all k G Z, and there is an edge between the vertices 2k — 1 and 2k for /c G Z if and 
only if £/j = 1. There are no other edges. 





32 



SHANKAR BHAMIDI, STEVEN N. EVANS, AND ARNAB SEN 



Let Bn (resp. B) denote the adjacency matrix of Gn (resp. G)- For each non- 
negative integer m, 



/ 



a;™ H{dx) 

= Urn / x'^Hnidx) 

n— >cso J 

Urn E Ix'^Hr.idx) 



: hm E 

n— >oo 



:E 



-(B' 
2^ 



111 



2^ 



)22 



(because Bij = for \i — j\ > 2, there is no problem defining B"^). 

Divide the matrix B into 2x2 blocks with the rows (resp. columns) of the (i, j)^^ 
block indexed by {2i + 1, 2?' + 2} (resp. {2j + 1, 2j + 2}). This block form matrix is 
block tridiagonal. The entries in the diagonals above and below the main diagonal 
are always the matrix 

'1 0^ 







We use this notation because we can think of Hx as the matrix for the orthogonal 

projection onto the x-axis in a two-dimensional {x, y) coordinate system. The entry 
in the (fc, k) diagonal block is the matrix 

£k+i\ 



Sk+l 



J ■ 



If Sk+i = 1, then this is the matrix 



1 

1 



that permutes the two coordinates. Otherwise, it is is the 2x2 zero matrix. By 
analogy with the definition of H^, set 

'0 0^ 



n„ 



1 



We have the following relations between these matrices 



n2 

X 


= n^ 




y 


= n^ 




UxUy 




= 




= I 










En, 


= n,s 




trlla; 


= trlly = 


1 


tr sn^ 


= trn^E 


= 


tr Y;Uy 


= trHyS 


= 0. 



A consequence of these relations is the following. 
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Lemma 8.1. For a > 0, 

trE" = 

For any r > 1, 

tr (s^^n^E^^n^ • • • E'*'-n^E"'-+i) 



if a is even, 
if a is odd. 



1, if ai + Or+i, a2, . . . ,ar are all even, 
0, otherwise. 



Proof. The first claim is obvious, because E" either the 2x2 identity matrix or E 

depending on whether a is even or odd. 

For the second claim, first observe that the product in question may be rewritten 

as 

E^in^E^^n^ • • • E^'-n^E'''-+i , 

where bg is or 1 depending on whether is even or odd. This in turn may be 
rewritten as 

where ze, 1 < i < r, is x or y depending on whether bi + ■ ■■ + be is even or odd, 
and c is or 1 depending on whether hi + ■ ■ • + 6^+1 is even or odd. 

The product is non-zero if and only ii zi = Z2 = ■ . ■ = Zr- This is equivalent to 
either 6i = and 62 = • • • = ^'r = 0, in which case the product is (n3:)''E'^ = n^E"^, 
or 61 = 1 and 62 = . . . = fe,- = 0, in which case the product is (nj,)''E'^ = IIjjE'^. 

Furthermore, even if the product is non-zero, and hence of the form Hx^'^ or 
IIj^E'^, the trace is zero if c = 1. Otherwise, the trace is 1. 

Thus, the trace is zero unless 62 = . . . = 5,- = and bi+- ■ ■+br+i is even, in which 
case the trace is 1. This condition is equivalent to a2, . . . , and ai + ■ — h a^+i all 
being even, and the result follows. □ 

In order to carry out the combinatorial calculations involved in computing the 
expected value of (5™)ii + (i3™)22; it will be helpful to recast them as computations 
involving a certain killed random walk on a random subinterval of the integers. 

Let Ui,U2,. ■ . be independent, identically distributed random variables that are 
independent of the random infinite matrix B and have common distribution 

¥{Uk = -1} = = 0} = ¥{Uk = +1} = ^. 
SetWm = Ui-\ \-Um (with Wq = 0). Note that 

^ V(B'")21 (S™)22, 

= S'^E [S1S2 • • • Eml{Wm = k a<We<(3,l<e<m}], 



where 



and 



a :~ max{fc < : Sk+i = 0}, 
P := min{fc > : Sk+i = 0} 

^ fn,, if[/^e{-i,+i}, 
■ I E, if Ue = 0. 
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Lemma 8.2. The random variables a and (3 satisfy 

P{/3- a = A^} = < 0, ifN=l, 
[(7V-l)(l-g)^-V, ^fN>2, 

and 

P{0-a=j|/3-« = 7V} = iV>2, 1 <j <iV-l. 

Proof. It is clear that 

P{/3 - a = 0} P{£i = 0} = g 

and P{/3 - a = 1} = 0. 

For TV > 2 and 1 < j < iV - 1, 
P{/3 -a = N, 

= P{£_j+i = 0, £-j+2 = ■ ■ ■ = £-j+N = 1, + = 0} 

and the remainder of the result follows. □ 

Consider the moment generating function ip{t) :— J2m=o 7^. I x"^ H{dx). From 
the above, (p{t) = (p{—t) for alH e M and 

exp(-3t)(^(t) = ifi [tr (S1S2 • • • Em)1{Wm = k a < We < (3, 1 < £ < M}] 

for t > 0, where M is a Poisson distributed random variable with expected value 3i 
that is independent of the random infinite matrix B and the increments J7i, C/2, ■ • ■■ 
Put I' := {1 < £ < M : Ui = ±1} and I" := {1 < £ < M : Ue = 0}. Then 
M' := ^T' and M" := #1" are independent Poisson distributed random variables 
with expected values 2t and t, respectively. Write T' = {£i,£2, ■ ■ ■ ,£m'}, with 
£1 < £2 < ■ ■ ■ < £m' ■ The set I" is the disjoint union of sets X",l2 j ■ • • j^^m'+I' 
where 

I'l ■.^{1<£<M :£< 4}, 
X'- :={!<£< M : £j-i <£< £j}, l<j< M' , 

and 

I'ir+i ■={l<i<M:£M' < £}■ 

Conditional on M' = m' , the random variables #1", , . . . , #X''j/^]^ are in- 
dependent and identically distributed with a common distribution that is Poisson 
with expected value t/{m' + 1). 

In view of Lemma |8. 11 the following well-known result will be useful. 

Lemma 8.3. The probability that a Poisson random variable with expected value 
A takes an even value is ^[l + exp(— 2A)]. 

Proof. The probability in question is 

_=exp(-A)-,_,.„ , 2 



A^^ 1 1 

^''''P^^^^T^fcV =''''P^"^)2[''''P(^)+''''P^"^)] = ^[l + exp(-2A)]. 

k=0 



□ 
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The conditional distribution of the sequence — Wo , We^ , • ■ • , We^^, given M' — 
m' is that of a symmetric simple random walk with m' steps, and we need the 
following result. 

Lemma 8.4. Fix an integer N > 2. Suppose that Sq, Si, S2, ■ ■ ■ is a symmetric 
simple random walk. Then 

N-l N-l / J, " 

= J & < 5, < TV, 1 < £ < n I 5o = j} = ^ (cos^ 
j=i k=i ^ 

for n — 0,1,2, . . .. This quantity is zero when n is odd. 

Proof. Consider the transition matrix P of a symmetric simple random walk killed 
on exiting {1, 2, . . . , N — 1}. That is, P is the {N — 1) x {N — 1) symmetric matrix 
with rows and columns both indexed by{l,2,...,Af — 1} and entries 

I 0, otherwise. 



It is a straightforward (and well-known) calculation that the eigenvalues of P 
:os^, 1 < fc < iV - 

(sin^j)f=l'- Hence, 



are cos^, 1 < fc < iV — 1, and the corresponding normalized eigenvectors are 



^ ^ ' /c7r\" 2 / fcTT \ / kn 



Thus, 



Af-l Af-1 . 7 \ n r, N-l , , \ 2 

j=i k=i ^ ' j=i ^ ' 

N-l 



E 



cos — 
Nj 



k=l 

as claimed. □ 

Observe from Lemma 18.11 that for t > the moment generating function ip of H 
satisfies 

exp(-3i)(^(i) = ilE [tr (S1S2 • • • Em)1{Wm = k a < We < (3, 1 < i < M}] 



2 



X 1{Wm =0 & a<Wi< I3,1<1<M} 

= i (2 X P {M' = 0, M" is even, & a < < ^ 

+ 1 X P{M' > 0, + #2^', . . . , are all even, 

Sm' =0, k a< Sk< I3,l<k< Af'}' 
where 5^ = W^. for 1 < A: < M'. 
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Conditioning on M' taking the even value 2h and (3 — a taking the value N , we 
hence, from Lemmas 18.31 and 18. 4| 

exp(-3i)v3(i) = exp(-2t)i [1 + exp(-2t)] (1 - q) 



^ OG 

-^exp(-2i) 



2h 



h=l 
1 



2h\ 



1 + exp 



£(7V-1)(1 



2h 

q) 



1 + exp 



2t 



N=2 



N-l 

N -I ^ 

k=l 



2h+\ 

2h 

I 



for t > 0. Thus, 



(1 — q) coshi 
2^ 2h\ 

h=l 

X cosh 



2t 



2h+\ 



cosh 

N-l 



t 



2h+\ 



2/1-1 



2h 



2h-l 



N=2 fc=l ^ 

for t > 0. The right-hand side thought of as a function of t G M is an even function, 
and it follows that the right-hand side is the correct expression for ip{t) for all t e M. 
Finally, if we write ■0 for the moment generating function of F, then 

fit) - f 



1 - ^ 

^ 2 
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